HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
https://arxiv.org/abs/1809.09600
https://aclanthology.org/D18-1259/
We introduce HotpotQA, a new dataset with 113k Wikipedia-based question-answer pairs with four key features:
(1) the questions require finding and reasoning over multiple supporting documents to answer;
(2) the questions are diverse and not constrained to any pre-existing knowledge bases or knowledge schemas;
(3) we provide sentence-level supporting facts required for reasoning, allowing QA systems to reason with strong supervision and explain the predictions;
(4) we offer a new type of factoid comparison questions to test QA systems' ability to extract relevant facts and perform necessary comparison.
https://hotpotqa.github.io/